Financial Data Resampling for Machine Learning Based Trading by Tomé Almeida Borges & Rui Neves

Financial Data Resampling for Machine Learning Based Trading by Tomé Almeida Borges & Rui Neves

Author:Tomé Almeida Borges & Rui Neves [Borges, Tomé Almeida & Neves, Rui]
Language: eng
Format: epub
ISBN: 9783030683795


Amount: The purpose of this type of rearrangement is to sample data according to a fixed variation threshold rather than to a fixed percentage or time period. This process is identical to the Percentage rearrangement but is even simpler. Instead of calculating the percentual first order difference between closes, simply the actual first order difference series of the closing values is calculated. As was mentioned in Sect. 2.​3.​1, the first order difference of the close series corresponds to the changes from the close of an actual data point with its immediate previous data point and has the added benefit of achieving (or approaching) stationarity.

All procedures for calculating the alternative cumulative sum column, for assignment of new group identifiers and sample grouping are identically followed throughout the remainder of this resampling process. The only exception is the fixed threshold and how it is determined. In this case, a fixed price value is used as threshold instead. This fixed value corresponds to a fixed closing price variation of the base currency expressed in units of quote currency. To obtain this fixed value, firstly, the sum of absolute differences between closes is calculated, i.e., Sum(abs(differencing(closeprices))). The value obtained is divided by the total number of data points obtained in the percentage grouping. This procedure guarantees that the quantity of data points obtained is similar to the quantity obtained in percentage grouping. Nonetheless it is unlikely that the final amount of samples obtained correspond precisely to the amount of samples obtained in the first rearrangement. These two types of resampling procedures group samples according to different variations which may phase themselves differently resulting in less groups during the customized cumulative sum group identifier assignment process and consequently less samples in the final dataset. Anyhow, this is not a frequent occurrence and the total sample difference between this rearrangement and the percentage rearrangement is never substantial (it is never larger than 1% of the total amount of samples of each rearrangement). In the end, likewise to the percentage rearrangement, the same amount of closing price action, is approximately the identical for each candlestick.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.